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Let  F(x)  * f^x  + f^x  + ...  be  a formal  power  series  over  a field  A. 

Let  F^(x)  » x and  for  q - 1,2,...,  define  F^(x)  - F^’ 1 ^ (F  (x) ) . The 
obvious  algorithm  for  computing  the  first  n terms  of  F^(x)  is  by  the  com- 
position analogue  of  repeated  squaring.  This  algorithm  has  complexity  about 
log^  q times  that  of  a single  composition.  Brent  showed  that  the  factor  log,,  q 
can  be  eliminated  in  the  computation  of  the  first  n terms  of  (F(x))^  by  a 
change  of  representation,  using  the  logarithm  and  exponential  functions.  We 
show  the  factor  log2  q can  also  be  eliminated  for  the  composition  problem. 

F^^(x)  can  often,  but  not  always,  be  defined  for  more  general  q. 

We  give  algorithms  and  complexity  bounds  for  computing  the  first  n terms  of 

r a 1 

Flmj(x)  whenever  it  is  defined. 

We  conclude  the  paper  with  some  open  problems. 

Keywords 

Composition,  fast  algorithms,  formal  power  series,  symbolic  computation, 
generalized  composition,  functional  equations,  Schroeder  function,  iteration, 
similarity  transformations. 
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1.1 


1.  INTRODUCTION 

Let 

(1.1)  F (x)  « fjX  + f2x2  + ... 

be  a formal  power  series  over  a field  A.  Let  F^^(x)  ■ x and  for  q ■ 1,2,..., 
define  the  q-composite  of  F by 

(1.2)  Ftq](x)  - F^-1](F(x)) . 

Let  H(x)  be  the  reversion  of  F(x),  i.e.,  the  power  series  inverse  to  F(x) 
under  composition.  For  q * 1,2,...,  define 

(1.3)  F^”q^ (x)  - H^q ^ (x) . 

As  we  shall  see  below,  the  q-composite  of  F can  often  (but  not  always) 
be  defined  for  more  general  q.  If  q is  not  an  integer,  we  shall  call  F^(x) 
a generalized  q-composite.  We  confine  ourselves  to  the  case  that  F^q^(x) 
is  a power  series.  One  important  special  case  of  generalized  composition  is 
q ■ l/r,  where  r is  an  integer.  Then  G " F^r^(x)  is  an  rth  root  of  F 
under  composition,  and  satisfies  the  equation  G^r"(x)  ■ F(x). 

Let 

(1.4)  F (x)  • f x + ...  + f x“, 

n i n 

(1.5)  G(x)  - F^(x)  ■ g^  + g2x2  + .... 

(1.6)  Gq(x)  - gxx  + ...  + gn*n* 

Given  q and  FQ(x),  we  want  to  compute  GQ(x). 


T 
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In  this  paper  we  shall  give  algorithms  and  complexity  bounds  for  com- 

puting  G (x)  whenever  it  is  defined.  For  integer  q these  algorithms  are 
n 

asymptotically  faster  than  the  obvious  algorithms. 

We  discuss  the  last  point.  Let  COMP^(n)  denote  the  complexity  of  com- 
puting the  first  n terms  of  F(F(x)),  and  let  q be  a power  of  two.  Then  the 
obvious  algorithm  for  computing  Gn(x)  is  by  the  composition  analogue  of 
"repeated  squaring",  and  has  complexity  COMP^(n)lg  q.  (We  shall  denote 
log0  by  lg.)  Can  we  eliminate  the  multiplicative  factor  of  lg  q? 

An  analogous  problem  is  that  of  computing  Rn(x) , the  first  n terms  of 
(F(x))q.  Asymptotically  in  n,  the  complexity  of  forming  R-n(x)  is  the  same 
as  the  complexity  of  a single  multiplication  of  two  polynomials  of  degree  n. 
This  follows  from  the  observation  that  if  A(x)  is  a power  series  with  constant 
term  unity,  then  (A(x))q  s exp(q  In  A(x)).  This  may  be  viewed  as  a change  of 
representation  of  A(x)  to  a new  representation  where  multiplication  is  replaced 
by  addition,  followed  by  the  inverse  change  of  representation.  Brent  [76] 
showed  that  the  change  of  representation  could  be  computed  "fast". 

This  suggests  asking  whether  there  is  a change  of  representation  which 
reduces  composition  to  multiplication.  We  shall  see  that  there  is,  at  least  in 
the  "regular"  case  (see  Section  3).  Furthermore,  the  change  of  representation 
can  be  computed  "fast".  This  enables  us  to  eliminate  the  multiplicative  factor 
of  lg  q.  In  addition  we  shall  show  (Sections  4-6)  that  even  In  the  "non- 
regular" cases  we  can  still  eliminate  this  factor.  A bonus  is  that  our  algo- 
rithms apply  for  non-integer  q (so  long  as  F^(x)  is  a well-defined  power 
series) . 

The  problem  of  composition  and  generalized  composition  occurs  In  many 
applications  including  asymptotic  analysis,  difference  equations,  numerical 
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analysis,  and  dynamical  systems.  See,  for  example,  Aczel  [66],  Cherry  [64], 
de  Bruijn  [70],  Henrici  [74],  Knuth  [69],  Kuczma  [68],  Levy  and  Lessman 
[61], and  Melzak  [73].  The  study  of  composition  (often  called  iteration) 
may  be  viewed  as  a major  sub-field  of  mathematics.  See  Aczel  [66],  Gross 
[72],  and  Kuczma  [68]  for  very  extensive  bibliographies.  However,  little 
attention  seems  to  have  been  given  to  the  development  of  algorithms  for  com- 
puting F^^(x)  when  F (x)  is  a given  power  series. 

The  following  conventions  are  adopted  below.  We  deal  with  formal  power 
series;  that  is,  we  do  not  concern  ourselves  with  convergence.  Power  series  are 
denoted  by  upper  case  letters  such  as  A(x)  or  simply  A,  with  coefficients 

denoted  by  the  corresponding  lower  case  letters  such  as  a^.  If 
k k+1 

A(x)  ■ <^x  + a^^x  + ^ 0,then  ord(A)  m k.  It  is  convenient  to 

define  ord(O)  * ®.  If  ord(B-C)  2 k we  write  B * C + O(x^) . The  polynomial 

k- 1 k 

bg  + bjX  + ...  + b^jX  is  denoted  either  by  B(x)  mod  x or  by  B^  .l(x) • 

It  is  convenient  to  define  y(n,q)  - 0(6(n,q))  to  mean 
jY(n,q)|  sK|6(n,q)|  for  all  sufficiently  large  integer  n and  for  all  q 
under  consideration. 

We  summarize  the  remainder  of  the  paper.  Our  complexity  model  is 
specified  in  Section  2.  In  Section  3 we  study  the  "regular"  case  when  the 
multiplier  is  such  that  f^  ^ 0,  f1^  ^ 1,  m « 1,2,...  . In  the  following 
three  sections  we  consider  the  cases  f ^ ■■  0 ; f^  ■ 1;  f^  * 1,  integer  m > 1, 
but  f 1,  respectively. 

In  each  of  Sections  3,  4,  and  5 we  define  an  "auxiliary"  function,  demon- 
strate it  can  be  computed  fast  by  "divide  and  conquer",  and  show  how  it  can 
be  used  to  compute  F^^.  The  case  studied  in  Section  6 can  be  reduced  to 
that  of  Section  5.  In  the  concluding  section  we  state  a theorem  (Theorem  7.1) 
sunnarizing  our  results,  state  the  defining  aquations  for  all  cases,  and  mention 


some  open  problems. 
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2.  COMPLEXITY  MODEL 

In  this  section  we  state  our  complexity  model  and  summarize  the  complexity 
results  needed  below.  We  assume  that  scalar  arithmetic  operations  are  performed 
exactly  and  have  unit  cost.  Thus  our  time  bounds  are  invalid  if,  for  example, 
exact  rational  arithmetic  is  used.  However,  our  algorithms  should  still  be 
useful  in  this  case. 

Given  power  series  A(x)  and  B(x) , the  time  required  to  compute  A(x)B(x)  mod  x 
is  denoted  by  MULT(n) . If  ord(B)  2 1,  the  time  required  to  compute  A(B(x))  mod  x” 
is  denoted  by  COMP(n) . We  assume  that  MULT(n)  and  COMP(n)  satisfy  certain 
plausible  regularity  conditions  (see  Brent  and  Rung  [76,  Section  1]).  Then 
Brent  and  Rung  [76]  show 

(2.1)  COMP (n)  - 0(min(n(1+r)//2,(n  lg  n) ^ 2MULT (n) ) ) , 

if  matrix  multiplication  has  complexity  0(nr) . If  the  field  A is  such  that 
fast  algorithms  like  the  FFT  are  available,  then 

(2.2)  MULT (n)  - 0(n  lg  n) 

(see  Borodin  and  Munro  [75]),  and  it  follows  from  (2.1)  that 

(2.3)  COMP(n)  - 0((n  lg  n)3/2). 

The  bounds  in  this  paper  will  be  expressed  in  terms  of  the  complexity 
function 

Ug  nj 

(2.4)  COMP2(n)  - £ 2J  C0MP(r2"jnl). 

J-o 
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Assume 

(2.5)  COMP(n)  ~nas(n), 

where  a 2 1,  and  s (n)  is  a monotonic  increasing  positive  function.  For 
example,  s(n)  might  be  (lg  n)  p for  some  constant  3*0.  Then 


(2.6)  C0MP2(n) 


0(C0MP(n) ) , if  or  > 1 

0(C0MP(n)lg  n)  , if  or  - 1 


If  the  field  A is  such  that  (2.3)  holds,  then  a £ 3/2.  If  this  is  best  pos- 
sible, then  C0MP2(n)  may  be  replaced  by  0(C0MP(n))  in  our  bounds. 

If  or  • 1 and  q is  a fixed  integer,  then  "repeated  squaring"  is  asymp- 
totically faster  than  our  algorithms.  Of  course,  if  q is  not  an  intiger, 
then  "repeated  squaring"  is  not  an  alternative  to  our  algorithms.  If  a > 1, 
our  result  (that  we  can  eliminate  the  multiplicative  factor  of  lg  q)  holds 
for  all  fields  of  characteristic  zero  and  all  finite  fields  of  characteristic 
p greater  than  n. 

q q 

If  f^  is  defined,  we  denote  the  complexity  of  computing  f^  by  POWER (q) . 

If  q is  a positive  integer,  then  POWER(q)  ■ 0(lg  q) . 

In  Brent  [76]  it  is  shown  that  the  complexity  of  computing  ln(l+A(x) )mod  x° 

is  0(MULT(n))  for  any  power  series  A,  ord(A)  > 0.  Using  3rent' s results  it 

Q n 

can  be  shown  that  the  complexity  of  computing  (B(x))  mod  x is 

0(MULT(n)  + POWER (q ) ) . By  Brent  and  Rung  [76,  Lemma  4.2]  MULT (n)«0 (COMP (n) ) , 

so  we  can  absorb  MULT (n)  into  COMP(n)  in  our  analyses. 


2.3 


Recall  that  COMP^n)  was  defined  as  the  complexity  of  computing  the 
first  n terms  of  F (F (x) ) . It  can  be  shown,  by  means  similar  to  the  proof 
of  Brent  and  Kung  [76]  that  the  complexity  of  reversion  and  composition 
are  asymptotically  equal,  that  CCMP(n)  - CKCOMP^n)). 
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3.1 


3.  THE  REGULAR  CASE 

In  this  section  we  study  the  computation  of  F^(x)  when  0,  f“  4 1, 

m ■ 1,2,...  . We  call  this  the  regular  case.  Define  the  Schroeder  function 
S(x)  by 

(3.1)  S (F (x) ) - f1S(x),  ord(S)  - 1,  8j-l. 

S(x)  exists  and  is  unique  (Schroeder  [1871],  Kuczma 

[68,  Chapter  6]).  See  also  Parker  [77],  It  is  easy  to  prove  that,  for  all 
integer  q, 

(3.2)  F[q](x)  - ^(fj  S(x) ) . 

S(x)  and  S^"^(x)  play  the  role  that  the  logarithm  and  exponential  functions 
play  in  computing  ,'F(x))q  fast.  They  reduce  self-composition  to  scalar 
powering.  Note  also  the  analogy  with  diagonalizing  a matrix  by  a similarity 
transformation. 

q 

If  q is  not  an  integer  but  q and  the  scalar  f^  are  such  that  f^  is 
defined,  then  (3.2)  may  be  used  to  define  We  shall  use  the  "divide  and  conquer" 

strategy  to  compute  S(x)  fast  and  then  show  how  to  compute  F1-^  from  (3.2) 
in  total  time  0(C0MP2(n)  + PCWER(q)). 

Although  we  wish  to  solve  the  functional  equation  (3.1),  to  make  the 
"divide  and  conquer"  strategy  work  we  embed  (3.1)  in  the  more  general  linear 
functional  equation 


(3.3)  A(x)W (F (x) ) - B(x)W(x)  - C(x)  - 0, 

where  W is  the  unknown.  Note  that  this  equation  Includes  reversion  as  a 
special  case.  The  "divide  and  conquer"  algorithm  introduced  to  solve  (3.3) 


3.2 


may  therefore  be  used  to  revert  power  series.  This  algorithm  is  different 
from  the  one  derived  by  Newton  iteration  and  given  in  Brent  and  Rung  [761. 

Lemma  3.1  gives  the  basis  for  a "divide  and  conquer"  algorithm  for  solving 

(3.3) .  The  proof  is  by  substitution.  Lemma  3.2  gives  sufficient  conditions 
for  the  existence  of  a formal  solution,  and  Lemma  3.3  gives  an  upper  bound 
on  the  time  required  to  compute  an  approximate  solution. 

Lemma  3 . 1 

If  n,  p are  non-negative  integers,  ord(F)  S 1, 

(3.4)  A(x)U  (F  (x) ) - B(x)U(x)  - C(x)  =■  x”r(x) 
and 

(3.5)  A(x) (F(x)/x)nV(F(x))  - B(x)V(x)  + R(x)  - 0(xP), 
then 

(3.6)  A(x)W(F (x) ) - B(x)W (x)  - C(x)  - 0(xn+P) 
where 

(3.7)  W (x)  - U (x)  + xnV(x).  ■ 

Remark  3 . 1 

If  Lemma  3.1  is  applied  for  n * p * 2^,  J ■ 0,1,2,...,  we  have  an  algo- 
rithm for  approximating  W(x)  which  is  quadratically  convergent  in  the  sense 
of  Rung  and  Traub  [76].  ■ 

Lemma  3 . 2 

If  ord(F)  i 1, 

(3.8)  aQf^  ^ bQ  for  all  m ■ 1,2,3,... 


3.3 


and 

(3.9)  aQ  - bQ  implies  cQ  - 0 


then  there  is  a formal  power  series  W,  satisfying  (3.3),  with  orc(W)  ■ 0 unless 

c0  " 0 and  a0  + V 


Proof.  * 

We  shall  construct  w^.w^,...  such  that  W(x)  ■ ^ w^x^  satisfies  (3.3). 

, j"0 

We  let  J 

m 

(3.10)  Wa(x)  - £ wjxJ 

J-o 

and  show  by  induction  on  m that,  for  some  power  series  (x) , 

(3.11)  A(x)Wn(F(x))  - B(x)Wm(x)  - C (x)  - x^R^fr)  - 0(x°H‘1)  . 


Let 


(3.12)  w 


1 if  a0  “ b0 
c0/(aQ-b0)  otherwise 


Then  (3.11)  holds  for  m ■ 0,  starting  the  induction, 
holds  for  m 2 0,  we  define 


Assuming  that  (3.11) 


(3.13)  w 


W0) 


nH-1 


V*ofi 


m+1 


and  apply  Lemma  3.1  (with  n ■ bh-1,  p**l,U*W,V"w  .)  to  deduce  that 

01  WtrL 

(3.11)  holds  with  m replaced  by  m-1.  Thus,  the  result  follows  by  induction 


on  m. 


3.4 
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Lemma  3.3 

Suppose  that  Wg,...,wn  ^ can  be  found  in  time  t(n)  whenever  the  condi- 
tions of  Lemma  3.2  apply.  Then 

(3.14)  t(2n)  * 2t(n)  + C0MP(2n)  + 0(MULT(n)). 


Proof 

In  time  t(n)  we  find  u„,...,u  . such  that  (3.4)  holds  for  some  power 

u n-  l 

n-1 

Cl  2n 

series  R(x) , where  U(x)  ■ ^ ujx  • Compute  U(F(x))  mod  x in  time  CCMP(2n) , 
and  then  find  ^ ® 


(3.15)  R(x) 


mod  x 


n 


in  time  0(MULT(n) ) . [Note:  MULT (2n)  ■ 0 (MULT (n) ) . ] 

Since  ord(F)  i 1,  F(x)/x  is  a power  series,  and  by  an  algorithm  given 
in  Brent  [76]  we  can  compute  (F(x)/x)n  mod  xn,  and  thus 

(3.16)  A(x)  * A (x) (F (x)/x)°  mod  x*, 

in  time  0(MULT(n)).  Now  (3.5)  with  p ■ n is  just 

X(x) V(F (x) ) - B(x)V(x)  + R(x)  - 0(x°)  , 

so  we  can  find  vQ****»va_i  111  time  t(n).  Using  Lemma  3.1,  we  take 

if  0 S j < n 
if  n i ] < 2n 

and  the  result  follows.  ■ 
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Corollary  3.1 

With  Che  notation  of  Lemma  3.3, 

(3.17)  t(n)  - 0(C0MP2(n)). 

Proof 

This  follows  from  Lemma  3.3,  the  definition  of  COHP^Cn),  and  the  fact 
that  MULT(n)  - 0(C0MP(n)).  ■ 

Corollary  3.2 

If  ord(F)  * 1 and  f1^  ^ 1 for  m ■ 1,2,...,  then  we  can  compute  the  first  n 

coefficients,  Sg sn-l  t*ie  Schroeder  function  S(x)  satisfying  (3.1)  in 

time  0(C0MP2(n)). 

Proof 

We  solve  a special  case  of  (3.3),  namely 

(3.18)  (F (x)/ x)W(F (x) ) - fjW(x)  - 0, 

to  obtain  Wq,...,wq_2  by  the  method  of  Lemma  3.3.  Then  S(x)  * xW(x) 
satisfies  (3.1)  mod  xQ,  so  s^  ■ 0 and  Sj  • for  J “ 1 ■ 

Theorem  3 . 1 

Assume  ord(F)  ■ 1,  f^  ^ 1 for  a ■ 1,2,...  . Let  fq  be  defined  and  let 

(3.19)  G(x)  - F[q](x). 

Then  gn,...,g  . can  be  computed  in  time 

u n- 1 


(3.20)  0(C0MP2(n)  + PCWFR(q)) 


3.6 


Proof 

Using  the  method  of  Corollary  3.2,  we  compute  S , fx) 

n- 1 v 

that  s^  ^ 0 and 

(3.2D  Sq_1(F(x))  - f1Sn-1(x)  + 0(xn) 

in  time  0(C0MP2(n)).  Now 
(3-22)  Sn_1(G(x))  - fjsn-l(x)  + 0(xQ), 

and  thus 

(3.23)  G(x)  - SnIi](fiSn-l(x))  + 0(x^ • 

Using  the  method  of  Brent  and  Rung  [76],  we  can  compute  S^"|^(x)  mod  xD  in 

n- 1 

time  0(C0MP(n) ) - 0(CCMP2(n)),  and  the  gQ,...,g^_1  are  obtained  from  (3.23) 
in  time  COMP(n) . The  result  follows.  g 

Remark  3.2 

The  condition  f^  f 1 is  necessary  so  that  the  divisor  in  (3.13)  is  non- 
zero. Thus,  we  need  omly  assume  that  f^  1 for  m-1,2 n-2 . If  F is  a formal 

power  series  over  a finite  field  with  characteristic  p,  then  it  is  necessary 
to  assume  n * p.  g 

The  prccfs  above  are  constructive  and  give  the  following  two  algorithms. 


n- 1 

i 

3 ^ SjXJ  such 

J-l 
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Algorithm  3.1 

The  algorithm  ?$(A,B,C  ,F  ,W  ,m)  finds  w , such  that  W(x)  satisfies 

0 m- 1 

(3.3).  It  is  defined  recursively  by: 

If  m ■ 1 then  (use  equation  (3.12)  to  define  w^}  else 
(n  ♦-  Im/  2*1 ; 

4.(A,B,C,F,U,n); 

Compute  R using  equation  (3.15); 

Compute  7 using  equation  (3.16); 

^(S'.B.-R.F.V.n); 

For  J - 0 step  1 until  n-1  do  (w^  ~ Ujj  *"  v^}}.  ■ 

Algorithm  3.2 

The  following  algorithm  computes  G(x)  • F^(x)  if  the  conditions  of 
Theorem  3.1  apply: 

1.  Take  A(x)  ■ F(x)/x,  B(x)  ■ f . , C(x)  ■ 0 and  find  w.,...,v  such 

i u n-  l 

that  W(x)  satisfies  (3.3)  by  applying  ^(AtB,C,F,W,n-l)  (see  Algo- 
rithm 3.1). 

2.  Let  sQ  • 0,  Sj  ■ wj-i  for  J " l»*»*»a-l.  and  compute  S^*1^(f^S(x))  mod  xn 
using  the  composition  and  reversion  algorithms  of  Brent  and  Rung  [76]. 
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4.  MULTIPLIER  ZERO 

In  this  section  we  study  the  case  f^  « 0.  Since  the  problem  is  trivial 
if  F (x)  a 0,  we  can  assume  ord(F)  ■ k,  l<k<  •.  We  define  auxiliary  power 
series  S(x)  by 

(4.1)  S(F (x) ) - fk(S(x))k,ord(S)  - 1,  1. 

This  reduces  to  Schroeder' s equation  (3.1)  if  k ■ 1.  By  induction  on  q we 
have,  for  all  positive  integer  q, 

(4.2)  F^(x)  - S^-1^[f^q"1)/,(k"1)[S(x)  ]feq}. 

Remark  4 . 1 

The  restriction  to  positive  integer  q is  essential  here.  For  example, 
take  F * x3.  Then  F^  does  not  exist  as  a power  series  for  q ■ -1  or  q » 1/2.  9 
The  following  lemmas  reduce  the  solution  of  (4.1)  to  problems  solved  in 
the  previous  section. 

Lemma  4 . 1 

If  ord(F)  ■ k > 1 the  equation 

(4.3)  W(F (x) ) - kV(x)  + {(k-1)  + ln[F(x)/ (fkxk) J } - 0 

has  a solution  W(x)  , and  w„,...,w  . can  be  computed  in  time  0(C0MP„ (n) ) . 
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Proof 

Lemmas  3.1  to  3.3  are  applicable  to  (4.3),  so  W(x)  exists  and 
Wg,...,wn_^  can  be  computed  in  time  0(CCJMP2(n))  by  the  method  used  in  the 


proof  of  Lemma  3.3. 
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Lemma  4 . 2 

If  ord(F)  » k > 1 and  W(x)  satisfies  (4.3),  then 

(4.4)  S(x)  “x  exp(W(x)-l) 
satisfies  (4.1). 

Proof 

Substitute  W(x)  - 1 + ln(S(x)/x)  in  (4.3).  From  (3.12),  wQ  » 1,  so  S(x) 
is  a power  series . g 

Using  the  algorithm  of  Brent  [76]  we  can  compute  the  first  n coefficients 
of 

[S(x)/x]  - exp[kq(W(x)-l)] 

in  time  0(MULT(n))  once  Wg,...,wn_^  are  known.  We  can  also  compute  f^  - l]/(k- 1) 
in  time  POWER((k  -1)/ (k-1)).  Then,  using  a slight  modification  of  the  composi- 
tion and  reversion  algorithms  of  Brent  and  Rung  [76]  we  have: 

Theorem  4 . 1 

Assume  ord(F)  ■ k > 1,  q 2 1 is  i positive  integer,  and 

(4.5)  G(x)  - FCq](x)/xk<1. 

Then  ggf..,gn-1  can  be  computed  in  time 


(4.6) 


0(C0M?2(n)  + POWER ( (kq- 1)/  (k-1))). 
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5 . MULTIPLIER  UNITY 

Now  we  consider  the  case  that  the  multiplier  is  equal  to  unity.  We 
define  an  auxiliary  function  T by 

(5.1)  T (P  (x) ) * E'  (x)T(x)  , ord(T)  - ord(F(x)-x). 

T (x)  exists  and  is  unique  up  to  a scaling  factor  (Kuczma  [68,  Lemma  9.4]). 

Let  G(x)  ■ F^^(x).  Then  we  show  below  that  G(x)  may  be  computed  from  the  equation 

(5.2)  T(G(x))  - G'  (x)T(x)  . 

Remark  5 . 1 

T may  also  exist  if  f^  ^ 1.  If  F is  such  that  the  Schroeder  function  S 
exists,  then  T(x)  ■ cS(x)/s'(x),  where  c is  a nonzero  constant.  ■ 

Example  5.1 

If  F (x)  - 2x+x2,  then  S(x)  - ln(l+x) , T(x)  ■ (1+x) In (1+x) , F^(x)  - (1+x)  -l. 

2 

If  F (x)  ■ x/(l-x),  then  T(x)  » x . ■ 

Although  we  wish  to  solve  the  functional  equation  (5.1),  as  before  we 
need  to  embed  (5.1)  in  a more  general  equation.  Throughout  this  section  we 
define  d by  F (x)  » x+f^x*1  + ...,  f ^ 0,  and  let  k be  any  integer  greater 
than  d.  Then  we  shall  solve 

(5.3)  x1_d[(F(x)/x)kY(F(x))  - F'(x)Y(x)]  - A(x)  - 0 
for  Y(x) . 

Lemma  5.1  gives  the  basis  for  a "divide  and  conquer"  algorithm  for  solving 

(5.3) .  Lemna  5.2  gives  sufficient  conditions  for  the  existence  of  a formal  solution, 
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and  Lemma  5.3  gives  an  upper  bound  on  Che  time  required  to  compute  an 
approximate  solution.  Lemma  5.4  establishes  (5.2)  and  gives  a sufficient 
condition  for  G to  be  uniquely  defined. 

Lemma  5 . 1 

Let  n,  p be  non-negative  integers.  If 

(5.4)  x1‘d[(F(x)/x)kU(F(x))  - F' (x)U(x) ] - A(x)  » xnR(x) 
and 

(5.5)  x1‘d[(F(x)/x)k+tlV(F(x))  - F'  (x)  V(x)  ] + R(x)  - 0(xP)  , 
then 

(5.6)  x1'd[(F(x)/x)kW(F(x))  - Ff  (x)W(x)  ] - A(x)  - 0(xn+P) 
where 

(5.7)  W (x)  - U(x)  + xnV(x) 

Proof 

By  direct  substitution.  Note  that  since  F(x)  ■ x + fdxd  + ...,  the 
terms  in  square  brackets  in  (5.4)  to  (5.6)  have  ord  2 d-1.  ■ 

Lemma  5.2 

There  is  a formal  power  series  Y(x)  such  that 

(5.8)  x1"d[(F(x)/x)kY(F(x))  - F'(x)Y(x)]  - A(x) 

Proof  „ 

We  shall  construct  yQ,y1,...  such  that  Y(x)  - ^ y^xJ  satisfies  (5.3). 

1*0 

Recall  our  assumption  that  k > d ■ ord(F(x)-x).  Take 
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(5.9)  y 


0 (k-d)f , 


and  let 


(5.10)  Yn(x) 


Thus 


L yjx 
j-o 


(5.11)  x1’d[(F(x)/x)kYn_1(F(x)) 


- F'  (x)Yq_1(x)]  - A(x) 


xnRQ(x) 


is  true  for  n 
(5.12)  yn 


■ 1 (where  R 

-Rn<0) 

(k+n-d) f , 
a 


is 


some  power  series) . 


Define 


for  nil.  Using  Lemma  5.1  wita  p • 1,  it  is  straightforward  to  prove  that 
(5.11)  holds  for  all  n i 1,  by  induction  on  n.  Thus,  the  result  follows.  ■ 


Lemma  5 . 3 

Suppose  that  yg,...,yn_^  can  found  in  time  t2(n)  whenever  the  condi- 
tions of  Lemma  5.2  apply.  Then 

(5.13)  t2(2n)  £ 2t2(n)  + COMP(2n+d-l)  + 0(MULT(n)). 

Proof 

In  time  t2(n)  we  find  u^ un- 1 suc^  that  (5.4)  holds  for  some  power 

n-1 

series  R(x) , if  U(x)  - ^ u Compute  U(F(x))  mod  x2n+d-1  and  then 

n J“° 

R(x)mod  x from  (5.4) . Then  find  vg»*«-»vn_1  such  that  V(x)  satisfies  (5.5) 

with  p - n (this  takes  time  t2(n)  + 0(MULT(n))).  From  Lemma  5.1  we  can  take 

uj  if  0 £ j < n 

v if  n £ j < 2n 


1 


so  we  get  y0>-**>y2n-l  in  time  2t2(n)  + C0MP(2n+d-l)  + 0(MULT(n))  as 
required. 

Corollary  5.1 

With  the  notation  of  Lemma  5.3,  t2  (n)  ■ 0(C0MP2(n)). 

Corollary  5.2 

There  exists  a formal  power  series  T(x)  such  that  ord(T)  ■ d and 

(5.14)  T (F (x) ) - F' (x)T(x) . 

Moreover,  t.,...,t  , can  be  found  in  time  0(COMP„ (n) ) . 

a n-  i i. 

Proof 

(5.15)  If  A(x)  - x"2d[F'(x)xd  -(F(x))d]  - (f^-f^)  + ... 
and 

(5.16)  x1_d[(F(x)/x)d+1Y(F(x))  - F' (x)Y(x) ] - A(x) 
then 

d d+1 

(5.17)  T(x)  - x + x Y(x) 

satisfies  (5.14).  Thus,  the  result  follows  from  Lemma  5.2  and  Corollary 
5.1. 

Lemma  5.4 

Let  q be  an  Integer,  T satisfy  (5.14),  and 

(5.18)  G(x)  - F^(x) . 

Then 


(5.19)  T(G(x))  - G' (x)T(x) 
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and  the  power  series  G(x)  is  uniquely  determined  by  (5.19)  and  the  condition 
(5.20)  ord(G(x)  - x - qf^x**)  > 

Proof 

It  is  easy  to  prove  (5.19)  by  induction  for  positive  q,  and  the  result 
for  negative  q then  follows.  It  is  also  easy  to  prove  by  induction  that 
(5.20)  holds  if  G is  defined  by  (5.18).  From  Lemma  9.4  of  Kuczma  [68]  the 
solution  of  (5.19)  satisfying  (5.20)  is  unique,  so  the  result  follows.  ■ 

Once  T(x)  is  known,  we  can  solve  (5.19)  for  G(x) , using  the  "initial 
condition"  (5.20).  Since  (5.19)  is  a nonlinear  differential  equation  for  G, 
we  can  use  a Newton-type  method  as  described  in  3rent  and  Kung  [76].  The 
algorithms  are  given  below.  First  we  summarize  the  result: 

Theorem  5 . 1 

Assume  ^ * 1 and  let  G * F^(x).  Then  Sg’***^^  can  be  computed  in 
time  0(CCMP2(n)). 

Proof 

First  find  t^,...,^^  such  that  T(x)  satisfies  (5.1),  as  in  Corollary 
5.2,  in  time  0(C&iP2(n)).  Then  solve  (5.19)  and  (5.20)  by  Algorithm  5.3 
below  (in  time  0(C0MP(n))  to  find  gQ gQ  . g 

Remark  5.2 

Note  that  q need  not  be  an  integer  in  Theorem  5.1.  Kuczma  [68,  Theorem 
9.15]  considers  the  question  of  when  F^(x)  is  analytic.  See  also  Baker  [64] 
and  Szekeres  [64].  g 
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Algorithm  5.1 

The  algorithm  3i(A,F,Y,k,d,n)  finds  such  that  Y(n)  satisfies 

(5.8).  It  is  assumed  that  n > 0,  an,...,a  and  f f,  , are  given, 

u n- 1 1 a+n- 1 

and  that  the  conditions  stated  after  Example  5.1  are  satisfied.  25(A,F,Y,k,d,n) 
is  defined  recursively  by: 


If  n ■ 1 then  (define  y^  by  (5.9)} 
else  (p  *-  fn/ 2l ; 

<B(A,F,U,k,d,p); 

Compute  U(F(x))  mod  x'^P  S 

Compute  R (x)  mod  xP  from  (5.4)  with  n replaced  by  p; 


<8(-R,F,Vfk+p,d,p); 

For  j *-  0 step  1 until  j-1  do 


Cy 


j V Vi 


- V }}. 


Algorithm  5.2 

The  algorithm  5(F,T,d,n)  finds  t^,...,t  ^ such  that  T(x)  satisfies  (5.14). 

It  is  assumed  that  f^,...,fQ_^  are  given  and  that  the  conditions  of  Corollary 
5.2  are  satisfied. 


Y - 0;  T - 0;  td  - 1; 

if  n > d+1  then  (compute  A(x)  mod  x11  from  (5.15); 

S(A,F ,Y,d+l,d,n-d-l) ; 

For  J *•  d+1  until  n-1  do  t^  — ■ 
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Algorithm  5.3 

The  following  algorithm  computes  such  that  G(x)  * F^(x). 

It  is  assumed  that  have  been  computed  using  Algorithm  5.2. 


„ , d 

G *-  x + q f^x  ; 

k - 1; 


while  k+d  < n do 

(k  «-  min(2k,n-d)  ; 

d _ T(G(x))-G' (x)T(x) 
a 

x T (x) 


_ d T'  (G  (x) ) 
x T(x) 


V **  I*  E(y)R(y)dy 

G - G + xdV 


k-1 
mod  x ; 

, k-2 

mod  x ; 

, k-l 

mod  x ; 

, k 
mod  x ; 

, k+d. 
mod  x j . 


Remark  5 . 3 

It  can  be  verified  that  all  the  quantities  appearing  on  the  lefthand 
sides  in  Algorithm  5.3  are  indeed  power  series. 
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6.  MULTIPLIER  NONTRIVIAL  ROOT  OF  UNITY 

In  this  section  we  consider  the  only  remaining  case:  1,  ■ 1 for 

some  integer  m > 1.  By  Remark  3.2  we  may  assume  m £ n-2.  We  also  assume  q is  an 
integer. 

Remark  6 . 1 

The  restriction  to  integer  q is  essential  here.  For  example,  let 

F ■ -x  + x2  + x3.  There  is  no  formal  power  series  for  F^2^(x).  That  is, 

r2 1 

there  is  no  power  series  G(x)  such  that  G1-  J(x)  - F(x)  (Kuczma  [68,  p.  304}).  ■ 

In  what  follows  we  shall  use  the  following  algebraic  relations: 

(6.1)  FCfm,:,(x)  - F[p](FCq](x))  , 

(6.2)  F^P<I^(x)  ■ R^(x),  where  R(x)  ■ F^(x), 

for  integer  p,  q.  If  q is  negative  we  compose  F^-1"  instead  of  F,  so  without 
loss  of  generality  we  may  assume  that  q is  positive.  Let 

(6.3)  q ■ mr  + s, 

where  r 2 0,  0 s s < m.  We  can  evaluate  M * F^  ■ x + . ..,  and  F^Sj  by  the 
obvious  "squaring"  method  in  time  0(C0MP(n)  lg  m)  - 0(C0MP(n)  lg  n) . Then, 
using  the  method  of  Section  5,  we  can  evaluate  F^™^  * in  time 

0(C0MP2(n))«  Finally,  F^  - F^mr^(F^s^)  may  be  evaluated  by  performing  one 
composition.  (An  additional  reversion  is  required  if  q < 0.)  Thus  we  have 
established 

Theorem  6.1 

Assume  ord(F)  "1,  f^  ^ 1,  f1^  * 1 for  some  m such  that  1 < m £ n-2, 


i 


I 
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J , rqi 

q integer,  and  let  G a F . Then  g ,...,g  can  be  evaluated  in  time 

i n- 1 

0(C0MP(n)  lg  m + COMP2(n)).  q 

Remark  6.2 

If  A is  the  real  field  (so  the  only  roots  of  unity  are  + 1)  then  Theorem 
6.1  shows  that  g1,....,gQ_1  can  be  evaluated  in  time  0(C0MPo(n>).  q 
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7.  SUMMARY  AND  OPEN  PROBLEMS 

From  Theorems  3.1,  4.1,  5.1,  and  6.1  we  have 
Theorem  7 . 1 

Let  F (x)  be  a formal  power  series,  ord(F)  * 1,  and  let  G(x)  - F^(x). 

If  q satisfies  the  following  conditions: 

(i)  If  ord(F)  > 1,  then  q is  a positive  integer; 

(ii)  If  the  multiplier  f is  a nontrivial  root  of  unity,  then  q is 
an  integer; 

(iii)  f^  is  defined; 

and  if  f^  is  given,  then  gj,...,gQ  can  be  computed  in  time  0(C0MP2(n))  and 
this  bound  is  independent  of  q.  ® 

Different  defining  equations  are  used  for  the  various  cases  we  have  had 
to  consider.  For  the  reader's  convenience  we  summarize  them  here.  As  before, 

q m 

I.  Regular  case:  f^  ^ 0,  f^  ^ 1,  m ■ 1,2,...  . Define  S by 
S (F (x) ) - fjS(x),  ord(S)  * 1.  Then  G(x)  ■ ^(fjS(x)). 

II.  fx  ■ 0.  Define  S by  S(F(x))  ■ ffe(S(x))k,  ord(S)  ■ 1,  s1  • 1. 

1h«  G(x)  - St’1 II. III. IV.'  I1'’}  . 

III.  fj^  ■ 1.  Define  T by  T(F(x))  • F'(x)T(x),  and  ord(T)  ■ ord(F(x)-x). 
Then  determine  G(x)  from  T(G(x))  • G'(x)T(x). 

IV.  fj  1,  f^  ■ 1 for  some  integer  m > 1.  This  can  be  reduced  to 
case  III. 
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It  is  possible  to  compute  G using  the  same  functional  equation  for 
cases  I-I1I.  Define  U(x)  by 

(7.1)  U(F(x>)  - (x)  U(x),  ord  (U(x))  =•  ord  (F(x)-x). 

ord(F) 

U(x)  exists  and  is  unique  up  to  a scaling  factor.  In  fact,  in  cases 
I and  II  we  have 

(7.2)  U(x)  * c S(x)/S'(x), 

and  in  case  III  we  have  U(x)  ■ c'  T(x),  for  some  non-zero  constants 
c and  c'  . Also,  it  is  easy  to  prove  that  G satisfies 

(7.3)  U (G(x) ) ■ OLS*} u(x). 

[ord (F)  ]a- 

Although  a unified  treatment  of  cases  I-III  using  (7.1)  and  (7.3) 
would  be  possible,  it  is  simpler  to  use  the  Schroeder  function  S(x)  of 
(3.1)  in  case  I and  the  generalized  Schroeder  function  of  (4.1)  in  case 
II,  for  then  G is  given  explicitly  by  (3.23)  or  (4.2)  instead  of  implicitly 
as  a certain  solution  of  (7.3).  Also,  in  proving  properties  of  algorithms 
for  the  computation  of  G by  either  method,  it  is  natural  to  consider 
cases  I-III  separately. 

The  techniques  of  Sections  3 and  5 can  be  applied  to  far  more 
general  nonlinear  functional  equations.  We  shall  report  on  this 
elsewhere. 

To  conclude  we  list  some  open  problems  suggested  by  the  results  of  the  paper. 

1.  If  the  field  A is  such  that  MULT(n)  ■ 0(n  lg  n)  then  the  fastest  algo- 
rithm known  for  composition  is  0((n  lg  n)  ^ ),  No  nontrivial  lower  bound 


is  known 


I 

I, 
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a.  Is  composition  harder  than  multiplication?  (It  is  at  least  as  hard.) 

b.  Although  there  are  only  n inputs  and  n outputs,  the  best  upper 

3/2 

bound  known  is  0((n  Ig  a)  ' ).  This  is  comparable  to  matrix 

2 2 

multiplication  where  there  are  2n  inputs  and  n outputs  but  the 

Is?  7 

best  upper  bound  known  is  0(n  ).  Can  the  Brent- Rung  upper 

bound  be  reduced? 

c.  Is  a > 1 in  the  notation  of  (2.5)?  An  affirmative  answer  would 
show  that  COMP^Cn)  ■ 0(C0MP(n)). 

2.  Brent  and  Rung  [76]  showed  that,  for  the  reversion  problem  R(x)  * F^”^(x), 
the  complexity  of  computing  R^x)  is  0(C0MP(n)).  Consider  computing 
Rjj(xq)  a scalar  n.  This  problem  has  n inputs  and  one  output.  Brent 
and  Rung  [76]  showed  its  complexity  to  be  0(MULT(n)).  If  G(x)  » F^(x), 
what  is  the  complexity  of  computing  Gn(Xg)?  Is  it  less  than  the  complexity 
of  computing  GQ(x)? 

3.  What  are  the  numerical  properties  of  our  algorithms?  For  example,  we 
expect  the  computation  of  the  Schroeder  function  to  be  ill-conditioned 
if  f^  is  close  to  1 for  some  m £ n-2;  see  (3.13).  Cherry  [64]  discusses 
this  problem  in  conjunction  with  a problem  in  dynamical  systems. 

4.  What  are  Che  complexity  bounds  for  exact  arithmetic  over  the  rational  field? 


• s, » 
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